Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Semi-Supervised Morphosyntactic Classification of Old Icelandic

Identifieur interne : 000086 ( Main/Exploration ); précédent : 000085; suivant : 000087

Semi-Supervised Morphosyntactic Classification of Old Icelandic

Auteurs : Kryztof Urban [États-Unis] ; Timothy R. Tangherlini [États-Unis] ; Aurelijus Vij Nas [République populaire de Chine] ; Peter M. Broadwell [États-Unis]

Source :

RBID : PMC:4100772

Abstract

We present IceMorph, a semi-supervised morphosyntactic analyzer of Old Icelandic. In addition to machine-read corpora and dictionaries, it applies a small set of declension prototypes to map corpus words to dictionary entries. A web-based GUI allows expert users to modify and augment data through an online process. A machine learning module incorporates prototype data, edit-distance metrics, and expert feedback to continuously update part-of-speech and morphosyntactic classification. An advantage of the analyzer is its ability to achieve competitive classification accuracy with minimum training data.


Url:
DOI: 10.1371/journal.pone.0102366
PubMed: 25029462
PubMed Central: 4100772


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Semi-Supervised Morphosyntactic Classification of Old Icelandic</title>
<author>
<name sortKey="Urban, Kryztof" sort="Urban, Kryztof" uniqKey="Urban K" first="Kryztof" last="Urban">Kryztof Urban</name>
<affiliation wicri:level="2">
<nlm:aff id="aff1">
<addr-line>The Scandinavian Section, University of California Los Angeles, Los Angeles, California, United States of America</addr-line>
</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>The Scandinavian Section, University of California Los Angeles, Los Angeles, California</wicri:regionArea>
<placeName>
<region type="state">Californie</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Tangherlini, Timothy R" sort="Tangherlini, Timothy R" uniqKey="Tangherlini T" first="Timothy R." last="Tangherlini">Timothy R. Tangherlini</name>
<affiliation wicri:level="2">
<nlm:aff id="aff1">
<addr-line>The Scandinavian Section, University of California Los Angeles, Los Angeles, California, United States of America</addr-line>
</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>The Scandinavian Section, University of California Los Angeles, Los Angeles, California</wicri:regionArea>
<placeName>
<region type="state">Californie</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Vij Nas, Aurelijus" sort="Vij Nas, Aurelijus" uniqKey="Vij Nas A" first="Aurelijus" last="Vij Nas">Aurelijus Vij Nas</name>
<affiliation wicri:level="1">
<nlm:aff id="aff2">
<addr-line>Department of English, National Kaohsiung Normal University, Kaohsiung, Republic of China</addr-line>
</nlm:aff>
<country xml:lang="fr">République populaire de Chine</country>
<wicri:regionArea>Department of English, National Kaohsiung Normal University, Kaohsiung</wicri:regionArea>
<wicri:noRegion>Kaohsiung</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Broadwell, Peter M" sort="Broadwell, Peter M" uniqKey="Broadwell P" first="Peter M." last="Broadwell">Peter M. Broadwell</name>
<affiliation wicri:level="2">
<nlm:aff id="aff3">
<addr-line>The University Library, University of California Los Angeles, Los Angeles, California, United States of America</addr-line>
</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>The University Library, University of California Los Angeles, Los Angeles, California</wicri:regionArea>
<placeName>
<region type="state">Californie</region>
</placeName>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PMC</idno>
<idno type="pmid">25029462</idno>
<idno type="pmc">4100772</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4100772</idno>
<idno type="RBID">PMC:4100772</idno>
<idno type="doi">10.1371/journal.pone.0102366</idno>
<date when="2014">2014</date>
<idno type="wicri:Area/Pmc/Corpus">000180</idno>
<idno type="wicri:Area/Pmc/Curation">000180</idno>
<idno type="wicri:Area/Pmc/Checkpoint">000039</idno>
<idno type="wicri:Area/Ncbi/Merge">000201</idno>
<idno type="wicri:Area/Ncbi/Curation">000201</idno>
<idno type="wicri:Area/Ncbi/Checkpoint">000201</idno>
<idno type="wicri:Area/Main/Merge">000087</idno>
<idno type="wicri:Area/Main/Curation">000086</idno>
<idno type="wicri:Area/Main/Exploration">000086</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a" type="main">Semi-Supervised Morphosyntactic Classification of Old Icelandic</title>
<author>
<name sortKey="Urban, Kryztof" sort="Urban, Kryztof" uniqKey="Urban K" first="Kryztof" last="Urban">Kryztof Urban</name>
<affiliation wicri:level="2">
<nlm:aff id="aff1">
<addr-line>The Scandinavian Section, University of California Los Angeles, Los Angeles, California, United States of America</addr-line>
</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>The Scandinavian Section, University of California Los Angeles, Los Angeles, California</wicri:regionArea>
<placeName>
<region type="state">Californie</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Tangherlini, Timothy R" sort="Tangherlini, Timothy R" uniqKey="Tangherlini T" first="Timothy R." last="Tangherlini">Timothy R. Tangherlini</name>
<affiliation wicri:level="2">
<nlm:aff id="aff1">
<addr-line>The Scandinavian Section, University of California Los Angeles, Los Angeles, California, United States of America</addr-line>
</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>The Scandinavian Section, University of California Los Angeles, Los Angeles, California</wicri:regionArea>
<placeName>
<region type="state">Californie</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Vij Nas, Aurelijus" sort="Vij Nas, Aurelijus" uniqKey="Vij Nas A" first="Aurelijus" last="Vij Nas">Aurelijus Vij Nas</name>
<affiliation wicri:level="1">
<nlm:aff id="aff2">
<addr-line>Department of English, National Kaohsiung Normal University, Kaohsiung, Republic of China</addr-line>
</nlm:aff>
<country xml:lang="fr">République populaire de Chine</country>
<wicri:regionArea>Department of English, National Kaohsiung Normal University, Kaohsiung</wicri:regionArea>
<wicri:noRegion>Kaohsiung</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Broadwell, Peter M" sort="Broadwell, Peter M" uniqKey="Broadwell P" first="Peter M." last="Broadwell">Peter M. Broadwell</name>
<affiliation wicri:level="2">
<nlm:aff id="aff3">
<addr-line>The University Library, University of California Los Angeles, Los Angeles, California, United States of America</addr-line>
</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>The University Library, University of California Los Angeles, Los Angeles, California</wicri:regionArea>
<placeName>
<region type="state">Californie</region>
</placeName>
</affiliation>
</author>
</analytic>
<series>
<title level="j">PLoS ONE</title>
<idno type="eISSN">1932-6203</idno>
<imprint>
<date when="2014">2014</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass></textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">
<p>We present IceMorph, a semi-supervised morphosyntactic analyzer of Old Icelandic. In addition to machine-read corpora and dictionaries, it applies a small set of declension prototypes to map corpus words to dictionary entries. A web-based GUI allows expert users to modify and augment data through an online process. A machine learning module incorporates prototype data, edit-distance metrics, and expert feedback to continuously update part-of-speech and morphosyntactic classification. An advantage of the analyzer is its ability to achieve competitive classification accuracy with minimum training data.</p>
</div>
</front>
<back>
<div1 type="bibliography">
<listBibl>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Cucerzan, S" uniqKey="Cucerzan S">S Cucerzan</name>
</author>
<author>
<name sortKey="Yarowsky, D" uniqKey="Yarowsky D">D Yarowsky</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Forsberg, M" uniqKey="Forsberg M">M Forsberg</name>
</author>
<author>
<name sortKey="Ranta, A" uniqKey="Ranta A">A Ranta</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Ranta, A" uniqKey="Ranta A">A Ranta</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Wagner, Ra" uniqKey="Wagner R">RA Wagner</name>
</author>
<author>
<name sortKey="Fischer, Mj" uniqKey="Fischer M">MJ Fischer</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Goldwater, S" uniqKey="Goldwater S">S Goldwater</name>
</author>
<author>
<name sortKey="Griffiths, Tl" uniqKey="Griffiths T">TL Griffiths</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Borin, L" uniqKey="Borin L">L Borin</name>
</author>
<author>
<name sortKey="Forsberg, M" uniqKey="Forsberg M">M Forsberg</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Loftsson, H" uniqKey="Loftsson H">H Loftsson</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Toutanova, K" uniqKey="Toutanova K">K Toutanova</name>
</author>
<author>
<name sortKey="Johnson, M" uniqKey="Johnson M">M Johnson</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Lafferty, J" uniqKey="Lafferty J">J Lafferty</name>
</author>
<author>
<name sortKey="Mccallum, A" uniqKey="Mccallum A">A McCallum</name>
</author>
<author>
<name sortKey="Pereira, F" uniqKey="Pereira F">F Pereira</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Clark, S" uniqKey="Clark S">S Clark</name>
</author>
<author>
<name sortKey="Curran, Jr" uniqKey="Curran J">JR Curran</name>
</author>
<author>
<name sortKey="Osborne, M" uniqKey="Osborne M">M Osborne</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Loftsson, H" uniqKey="Loftsson H">H Loftsson</name>
</author>
<author>
<name sortKey="Helgad Ttir, S" uniqKey="Helgad Ttir S">S Helgadóttir</name>
</author>
<author>
<name sortKey="Rognvaldsson, E" uniqKey="Rognvaldsson E">E Rögnvaldsson</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Schmid, H" uniqKey="Schmid H">H Schmid</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Ratnaparkhi, A" uniqKey="Ratnaparkhi A">A Ratnaparkhi</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Chatzis, Sp" uniqKey="Chatzis S">SP Chatzis</name>
</author>
<author>
<name sortKey="Demiris, Y" uniqKey="Demiris Y">Y Demiris</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Renooij, S" uniqKey="Renooij S">S Renooij</name>
</author>
<author>
<name sortKey="Van Der Gaag, Lc" uniqKey="Van Der Gaag L">LC Van Der Gaag</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Liu, P" uniqKey="Liu P">P Liu</name>
</author>
<author>
<name sortKey="Lei, L" uniqKey="Lei L">L Lei</name>
</author>
<author>
<name sortKey="Wu, N" uniqKey="Wu N">N Wu</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Rish, I" uniqKey="Rish I">I Rish</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Rabiner, L" uniqKey="Rabiner L">L Rabiner</name>
</author>
<author>
<name sortKey="Juang, Bh" uniqKey="Juang B">BH Juang</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Forney, G" uniqKey="Forney G">G Forney</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Tataru, P" uniqKey="Tataru P">P Tataru</name>
</author>
<author>
<name sortKey="Sand, A" uniqKey="Sand A">A Sand</name>
</author>
<author>
<name sortKey="Hobolth, A" uniqKey="Hobolth A">A Hobolth</name>
</author>
<author>
<name sortKey="Mailund, T" uniqKey="Mailund T">T Mailund</name>
</author>
<author>
<name sortKey="Pedersen, Cns" uniqKey="Pedersen C">CNS Pedersen</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Ringger, E" uniqKey="Ringger E">E Ringger</name>
</author>
<author>
<name sortKey="Mcclanahan, P" uniqKey="Mcclanahan P">P McClanahan</name>
</author>
<author>
<name sortKey="Haertel, R" uniqKey="Haertel R">R Haertel</name>
</author>
<author>
<name sortKey="Busby, G" uniqKey="Busby G">G Busby</name>
</author>
<author>
<name sortKey="Carmen, M" uniqKey="Carmen M">M Carmen</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
</listBibl>
</div1>
</back>
</TEI>
<affiliations>
<list>
<country>
<li>République populaire de Chine</li>
<li>États-Unis</li>
</country>
<region>
<li>Californie</li>
</region>
</list>
<tree>
<country name="États-Unis">
<region name="Californie">
<name sortKey="Urban, Kryztof" sort="Urban, Kryztof" uniqKey="Urban K" first="Kryztof" last="Urban">Kryztof Urban</name>
</region>
<name sortKey="Broadwell, Peter M" sort="Broadwell, Peter M" uniqKey="Broadwell P" first="Peter M." last="Broadwell">Peter M. Broadwell</name>
<name sortKey="Tangherlini, Timothy R" sort="Tangherlini, Timothy R" uniqKey="Tangherlini T" first="Timothy R." last="Tangherlini">Timothy R. Tangherlini</name>
</country>
<country name="République populaire de Chine">
<noRegion>
<name sortKey="Vij Nas, Aurelijus" sort="Vij Nas, Aurelijus" uniqKey="Vij Nas A" first="Aurelijus" last="Vij Nas">Aurelijus Vij Nas</name>
</noRegion>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000086 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000086 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     PMC:4100772
   |texte=   Semi-Supervised Morphosyntactic Classification of Old Icelandic
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/Main/Exploration/RBID.i   -Sk "pubmed:25029462" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd   \
       | NlmPubMed2Wicri -a OcrV1 

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024